Sequence analysis UProC: tools for ultra-fast protein domain classification

نویسندگان

  • Peter Meinicke
  • Inanc Birol
چکیده

Motivation: With rapidly increasing volumes of biological sequence data the functional analysis of new sequences in terms of similarities to known protein families challenges classical bioinformatics. Results: The ultrafast protein classification (UProC) toolbox implements a novel algorithm (‘Mosaic Matching’) for large-scale sequence analysis. UProC is by three orders of magnitude faster than profile-based methods and in a metagenome simulation study achieved up to 80% higher sensitivity on unassembled 100 bp reads. Availability and implementation: UProC is available as an open-source software at https://github. com/gobics/uproc. Precompiled databases (Pfam) are linked on the UProC homepage: http://uproc. gobics.de/. Contact: [email protected]. Supplementary information: Supplementary data are available at Bioinformatics online.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

UProC: tools for ultra-fast protein domain classification

MOTIVATION With rapidly increasing volumes of biological sequence data the functional analysis of new sequences in terms of similarities to known protein families challenges classical bioinformatics. RESULTS The ultrafast protein classification (UProC) toolbox implements a novel algorithm ('Mosaic Matching') for large-scale sequence analysis. UProC is by three orders of magnitude faster than ...

متن کامل

Allerdictor: fast allergen prediction using text classification techniques

MOTIVATION Accurately identifying and eliminating allergens from biotechnology-derived products are important for human health. From a biomedical research perspective, it is also important to identify allergens in sequenced genomes. Many allergen prediction tools have been developed during the past years. Although these tools have achieved certain levels of specificity, when applied to large-sc...

متن کامل

A Novel Approach for Protein Classification Using Fourier Transform

Discovering new biological knowledge from the highthroughput biological data is a major challenge to bioinformatics today. To address this challenge, we developed a new approach for protein classification. Proteins that are evolutionarilyand thereby functionallyrelated are said to belong to the same classification. Identifying protein classification is of fundamental importance to document the ...

متن کامل

Molecular analysis of AbOmpA type-1 as immunogenic target for therapeutic interventions against MDR Acinetobacter baumannii infection

Introduction: Acinetobacter baumannii is associated with hospital-acquired infections. Outer membrane protein A of A.baumannii (AbOmpA) is a well-characterized virulence factor which has important roles in pathogenesis of this bacterium. Methods: Based on our PCR-sequencing of ompA gene in the clinical isolates, AbOmpA protein can be categorized into two types, named here type-1 and type-2. We ...

متن کامل

Ultra-fast 1-bit comparator using nonlinear photonic crystalbased ring resonators

In this paper, a photonic crystal structure for comparing two bits has beenproposed. This structure includes four resonant rings and some nonlinear rods. Thenonlinear rods used inside the resonant rings were made of a doped glass whose linearand nonlinear refractive indices are 1.4 and 10-14 m2/W, respectively. Using Kerr effect,optical waves are guided toward the correc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015